Distant annotation of Chinese tense and modality
نویسندگان
چکیده
In this paper we describe a “distant annotation” method by which we mark up tense and modality of Chinese eventualities via a wordaligned parallel corpus. We first map Chinese verbs to their English counterpart via word alignment, and then annotate the resulting English text spans with coarse-grained tense and modality categories that we believe apply to both English and Chinese. Because English has richer morpho-syntactic indicators for tense and modality than Chinese, we hope this distant annotation approach will yield more consistent annotation than if we annotate the Chinese side directly. We report experimental results that show this expectation is largely borne out.
منابع مشابه
Buy one get one free: Distant annotation of Chinese tense, event type and modality
We describe a “distant annotation” method where we mark up the semantic tense, event type, and modality of Chinese events via a word-aligned parallel corpus. We first map Chinese verbs to their English counterparts via word alignment, and then annotate the resulting English text spans with coarse-grained categories for semantic tense, event type, and modality that we believe apply to both Engli...
متن کاملAutomatic Inference of the Tense of Chinese Events Using Implicit Linguistic Information
We address the problem of automatically inferring the tense of events in Chinese text. We use a new corpus annotated with Chinese semantic tense information and other implicit Chinese linguistic information using a “distant annotation” method. We propose three improvements over a relatively strong baseline method – a statistical learning method with extensive feature engineering. First, we add ...
متن کاملHow And Where Do People Fail With Time: Temporal Reference Mapping Annotation By Chinese And English Bilinguals
This work reports on three human tense annotation experiments for Chinese verbs in Chinese-to-English translation scenarios. The results show that inter-annotator agreement increases as the context of the verb under the annotation becomes increasingly specified, i.e. as the context moves from the situation in which the target English sentence is unknown to the situation in which the target lexi...
متن کاملIntension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation
This paper describes current efforts in developing an annotation schema and guidelines for sentences in Episodic Logic (EL). We focus on important distinctions for representing modality, attitudes, and tense and present an annotation schema that makes these distinctions. EL has proved competitive with other logical formulations in speed and inference-enablement, while expressing a wider array o...
متن کاملAnnotating "tense" in a Tense-less Language
In the context of Natural Language Processing, annotation is about recovering implicit information that is useful for natural language applications. In this paper we describe a “tense” annotation task for Chinese, a language that does not have grammatical tense, that is designed to infer the temporal location of a situation in relation to the temporal deixis, the moment of speech. If successful...
متن کامل